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Figure 1(a) 
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MTPRSR-LATLGTVILLVCFCAG- -AAHSRGDTFQ- - 

MM I TNRTRRL LRA WW 1 1 A I GT A VG - - ENVTTPKGAT - - 

MY PTVKS MRVAHLTNLLTLLCLLCHTHL YVCQPTTLR - - 

YYKTILFFALIKVCSFNQTTTHSTTTSPSISSTTSSTTTSTSKPSNTTSTNSSLAASPQ 

MTLNR CVLLIVLTFSTACS Q~ " 

MVPNK HLLLIILSFSTACG Q- - 

MGVGGGPRWLCLWCVAALLCQGVAQEWAETTTPFA- - 

MVAWFGLWGFARLMATLALLCGRVALDE S SAT PS I P P - - 

MAHTG S TVCAFL I FAVL KNVF CQT PT S S S EVEDVI P EAN - 

MTRRRVLSVVVLLAALACRLGA Q- -TPEQ-- 

- -TSSSPTPPGSSSKAPTKPGEEASGPKSVDFYQFRVCSAS - ITGELFRFNLEQTCPDTK 

- -TTAKPTP-GPS - -TPTPP ENPPR- AEAFKFRVCSAS - ATGELFRFNLEKTCPGTE 

- - QPSDMTP - AQDAPTETPPPLSTNTNR - -GFEYFRVCGVA- ATGETFRFDLDKTCPSTQ 
NTSTSKPSTDNQGTSTPTIPTVTDDTAS -KNFYKYRVCSASSSSGELFRFDLDQTCPDTK 

TT PAS SDEN - - GKTPAIEK - - EYF K- YRVCSAS - TTGELFRFNLDRACPSTE 

TTPTTAVEK- -NKTQAIYQ- - EYF K- YRVCSAS -TTGELFRFDLDRTCPSTE 

THRPEWAEE - - NPANP FLP F - - RVCGAS PTGGEI FRFPLEESCPNTE 

THKPAVHHED - - NTTNP FLL F - - RVCGAS PTG -EI FRFPLEENC PNTE 

- -TVSDNIIRQQR- -NNTAKGIHSDPSA FPFRVCSAS -NIGDIFRFQTSHSCPNTK 

PAPPATTVQP - - TATRQ QTS FPFRVCELS - SHGDLFRFS SDIQCPS FG 

DKY-HQEGILLVYKKNIVPHIFKVRRYRKIATSVTVYRGLTES - - AITNKYELPRPVPLY 
DKT - HQEGI LMVFKKNI VPHI FKVRRYRKVATS VTVYRGWTET - - AVTGKQEVIRPVPQY 
DKK-HVEGILLVYKINIVPYIFKIRRYRKI ITQLTIWRGLTTS - - SVTGKFEMATQAHEW 
DKK-HVEGILLVLKKNIVPYIFKVRKYRKIATSVTVYRGWSQA- - AVTNRDDISRAIPYN 
DKV-HREGILLVYKKNIVPHIFKVRRYKKIATSVRIFNGWSREGVAITNKWELSRAVPKY 
DKV-HKEGILLVYKKNIVPYIFKVRRYKKITTSVRIFNGWTREGVAITNKWELSRAVPKY 
DKD-HIEGIALIYKTNIVPYVFNVRKYRKIMTSTTIYKGWSED- - AITNQHTRSYAVPLY 
DKE-HVEGILLIYKTNIVPYIFNVRKYRKLVTSTTIYKGWSQD- - AITNQYTSSFAMPLW 
D KE - HNEG I LL I FKEN I VP YVF KVRK YRK I VTT S T I YNG I Y AD - - A VTNQHVF S KS VP I Y 
TRENHTEGLLMVFKDNT I PYSFKVRSYTKIVTNILI YNGWYAD - - SVTNRHEEKFSVDS Y 

EISHMDSTYQCFSSMKVNVNGVENTFTDRDDVNTTVFLQPVEGLTDNIQRYFSQPVIYAE 
EINHMDTTYQCFSSMRVNVNGIVNTYTDRDFTNQTVFLQPVEGLTDNIQRYFSQPVLYTT 
EVGDFDSIYQCYNSATMVVNNVRQVYVDRDGVNKTVNIRPVDGLTGNIQRYFSQPTLYSE 
EISMIDRTYHCFSAMATVINGILNTYIDRDSENKSVPLQPVAGLTENINRYFSQPLIYAE 
EINLMDKNYQCHNCMQIEVNGLLNSYCDRDGNNKTVDLKPVDGLTGAITRYVSQPKIFAD 

E I D I MD KT YQCHN CMQ I E VNGMLNS Y YD RDGNN KT VD L K P VDGLT G A ITRYISQP K VF AD 
EVQMMDHYYQCFSAVQVNEGGHVNTYYDRDGWNETAFLKPADGLTSSITRYQSQPEVYAT 
EARL VD YNY E C YNG I Q VT ENGHL TT YVD RDG YNE S VRL VP ADGLT SSI RR YH S Q P E L YVT 
ETRRMDT I YQC YNS LD VTVGGNLL VYTDNDGSNMTVDLQ P VDGL S NS VRR YHS Q P E I HAE 
ETDQMDTIYQCYNAVKMTKDGLTRVYVDRDGWITVNLKPTGGLANGVRRYASQTELYDA 

PGWFPGIYRV RTTVNCB IVDMIARSAEPYNYFVTSLGDTVEVSPFCYNESSCST-TPSNK 
PGWFPGIYRV RTTVNCB IVDMIARSAEPYSYFVTALGDTVEVSPFCHNDSTCSV- AEKTE 
PGWMPGFYRVRTTVNCBIVDMVARSMDPYNYIATALGDSLELSPFQTFDNTSQS -TAPKR 
PGWFPGIYRV RTTVMCg WDMYARSVEPYTHFITALGDTIEISPFCHNNSQCTTGNSTSR 

AGWLWGTYKTRTTVNCBIVEMFARSADPYTYFVTALGDTVEVSPFCDAENSCPN AS 

PGWLWGT YRT RTTVNCB I VDMFARS AD P YT YFVT ALGDTVEVS P FCDVDNS C PN AT 

PRNLLWSYTTRTTVNCBVTEMSARSMKPFEFFVTSVGDTIEMSPFLKENGTEPE- - KILK 

PRNLLWSYTTRTTVNCEVIDMTARSHKPFEYFVTASGDSIETSPFYT-NASR R 

PGWLLGGYRR RTTVNCB VTETDARAVPPFRYFITNIGDTIEMSPFWSKAWNETEFS--GE 
PGWL I WTYRTRTTVNCL I TDMMAKSNS PFD FFVTTTGOTVEMS P F YDGKNKETF HE 
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Figure 1 (b) 
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NGLSVQVVLNHTVVTYSDRGTSPTPQNRIFVETGAYTLSWASESKTTAVCPLALWKTFPR 
NGLGARVLTNYTMVDFATR- - APTTETRVFADSGEYTVSWKAEDPKSAVCALTLWKTFPR 
AD MR VRE VKNY K FVD YNNRGT A P AGQ SRTFLETP S AT Y S WKT AT RQT AT CD L VHWKT F P R 
DATKWIEENHQTVDYERRG-HPTKDKRIFLKDEEYTISWKAEDRERAICDFVIWKTFPR 
DVLSSQVDFNHTVVDYGNRATSQQHGKRIFAHTLDYSVSWEAINKTTSVCSMVFWKGFQR 
DVLSVQIDLNHTVVDYGNRATSQQHKKRIFAHTLDYSVSWRAVNKSASVCSMVFWKSFQR 
RPHSIQLLKNYAVTKYGVGLGQADNATRFFAIFGDYSLSWKATTENSSYCDLILWKGFSN 
VP - - VQVL YNYS VTD YGVGLGSGENVTRFFATLNDFS I S WKAATENS S YCPLVL WKGFPS 
PDRTLTVAKDYRVVDYKFRGTQPQGHTRI FVDKEEYTLS WAQQFRNI S YCRWAHWKS FDN 
RADSFHVRTNYKIVDYDNRGTNPQGERRAFLDKGTYTLSWKLENR-TAYCPLQHWQTFDS 

S I QTTHEDS FHFVANE I T ATFTAP LT P VANFTDT Y S CLT SD I NTTLNAS KAKLAS TH 

A I QTTHEAS YHF VAND VT AT FT S P LSEVANFTGTYSCLDEVIQKTLNDTIKKLSDTH 

AIQTAHEHSYHFVANEVTATFNTP- - -LTEVENFTSTYSCVSDQINKTISEYIQKLNNSY 
AIQTIHNESFHFVANEVTASFLTSNQEETELRGNTEILNCMNSTINETLEETVKKFNKSH 
AIQTEHDSTYHFIANEITAGFSTS- - - KETLASFSSEYSCLMSDINSTLTDKIGRVNNTH 

AIQTEHDLTYHFIANEITAGFSTV KEPLANFTSDYNCLMTHINTTLEDKIARVNNTH 

AIQTQHNSSLHFIANDITASFSTP- -LEEEAN- FNETFKCIWNNTQEE JQKKLKEVEKTH 
AIQTKHEKSYHFIADAVTASFTTP - - LTDETS YFNTTYQCAWQDIEGEIQKRFDPVSKTH 

A I KT EHGKS LHFVAND I TAS F YT P NTQTRE VLGKHVCLNNT I ES ELKS RLAKVNDTH 

T I ATETGKS I HFVTDEGTS S FVTN TTVGIELPDAFKCIEEQVNKTMHEKYEAVQDRY 



VP - NGTVQYFHTTGGLYLVWQPMS AINLTHAQ - GDSGNPTS SPPPSAS P 

VT - NGSAQYYKTEGGLFLLWQPLT PLSLVDEMRGLNG TTPAP P 

VA-SGKTQYFKTDGNLYLIWQPLEHPEIEDID- - EpSDPEPTPAP— P- 

I R - DGEVKY YKTNGGLFL I WQAMKPLNLS EHT N- YTIER- - -N 

VP - NGTAQYFKTEGGMILVWQPLTAI ELEEAMI EATTVS PTPLS 

TP-NGTAEYYQTEGGMILVWQPLIAIELEEAMLEATTSPVTPSAP 

R P - NGT AK VY KTTGNL Y I VWQ P L I Q I D L LDT HAKL YNLTNAT AS PT S T P 

AR - NGS VQI YKTSGNL YWWQPLVQLDLLAAHAKT INSTDNSTS PTTAPN TT 

S P - NGTAQYYLTNGGLLLVWQPLVQQKLLDAKGLLDAVKKQQNTTTT T 

TKGQEAITYFITSGGLLLAWLPLTPRSLATVKNLTELTTPTSSPPSSPSPPAPSAARGST 

TTSASRRKRRSASTAAAGG GGSTDN LS YT QLQF AYDKL RDG I NQVLEELSRA 

TTSTVS RVRRS VNTNEQ ATDN LAAP QLQFATOKL RAS INKVLEELS RA 

STRRKREAADNGNSTSEVS KGSENP LITA QIQFAYDKL TTSVNNVLEELSRA 

KTGNKS RQKRS VDTKTFQG AKG LSTA QVQYAYDHL RTSMNHILEELTKT 

AHLTSRRTGRRKRDVSAG SENS VL LA Q I Q Y A YDKL RQ S I NNVL E E LA I T 

SSSRSKRAIRSIRDVSAG SENN VFLS QIQYAYPKL RQSINNVLEELAIT 

-TTSPRRRRRDTSSVSGGG NNGDNSTKEES VAAS QVQFAYDNL RKS INRVLGELSRA 

TSTSSRRKRRDTGNTATNN S S S NNS S ME ENLAT S Q VQ F A YPQ L RK S I NR VL E QL S R V 

TTTRSRRQRRS VS SGI DDV YTAEST ILLT QIQFAYDTL RAQINNVLEELSRA 

P AAVLRRRRRDAGNATT P VP PTAPGKS LGTLNNPATVQIQFAYDSLRRQI NRMLGDIJVRA 

WCREQVRDNLMVnr BLSKINPTSVMTAIYGRP VSAKFVGDAISVTECINVDQSSVNIHKSL 
WCREQVRDTYMWY BLSKINPTSVMTAIYGRP VSAKFVGDAISVTDCVAVDQASVSIHKSL 
WCREOVRDTLMWY BLSKVNPTSVM3AIYGKP VAARYVGDAI SVTDCI YVDQS S VNI HQS L 
WCREOKKDNLMWY BLSKINPVSVMAAIYGKP VAV1CAMGDAFMVSECINVDQASVNIHKSM 
WCREOVRQTMIWY BIAKINPTSVMTAIYGKP VSAKALGDVISVTECINVDQTSVSIHKSL 
WCREQVRQTMVWY BIAKINPTSVMTAIYQKP VSRKALGDVISVTECINVDQSSVSIHKSL 
WCREOYRASLMWY BLSKINPTSVMSAIYGRP VSAKLIGDWSVSDCISVDQKSVFVHKNM 
WCONOYRASLMWY BL3KINPTSVMSAIYGRP VSAKLVGDWQISDCITVDQESVFVHRNL 
WCREQHRASLMWN BLSKIKPT3VMSSIYQRP VSAKRIGDVISVSHCVWDQDSVSLHRSM 
WCLEQKRQNMVLR BLTKINPTTVMSS I YGKA VAAKRLGDVI SVSQCVPVNQATVTLRKSM 
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Figure 1(c) 
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RTN SKDVCYARPLVTFKFLNSSNLFTGQLGARNEIILTNNQVETCKDTCEHYFITRN 

RTS TPGMCYS RP PVTFRFLNSTTLFKGQLGPRNE 1 1 LTDNQVEACKETCEHYF I ASN 

RLQH- -DKTTCYSRPRVTFKFINSTDPLTGQLGPRKEI ILSNTNIETCKDESEHYFIVGE 

RTD DPKVCYSRPLVTFKFVNSTATFRGQLGTRNEILLTNTHVETCRPTADHYFFVKN 

KTT- - -NNDVCYSRPPVTFKFV^^SQLFKGQLGARNEILLSESLVENCHQNAEHFFTAKN 

KTE NNDICYSRPPVTFKFVNSSQLFKGQLGARNEILLSESLVENCHQNAETFFTAKN 

KVPG- - KEDLCYTRPVVGFKFINGSELFAGQLGPRNEIVLSTSQVEVCQHSCEHYFQAGN 
RVPG- - SKDLCYTRPVVGFKFINGSELFVGQLGARNEILLSTNLVEVCQHSCEHYFQGGN 
RVPGRDKTHECYSRPPVTFKFINDSHLYKGQLGVNNEILLTTTAVEICHENTEHYFQGGN 
RVPG - -SETMCYSRPLVSFSFINDTKTYEGQLGTDNEIFLTKKMTEVCQATSQYYFQSGN 

ETLVYKDYAYIjRTINTTDISTLNTFIALNLSFIQNIDFI^ 

VTYYYKDYVFVKKINTSEISTLGTFIALNLSFIENIDFRVIELYSRAEKKLSGSVTOIBT 
YIYYYraTYIFEEKLNLSSIATLDTFIALNISFIENIDFKTVELYSSTERKLASSVFDIM 
MTHYFKDYKFVKTMDTNNISTLDTFLTLNLTFIDNIDFKTVELYSETERKMAS -ALDLET 
ETYHFKNYLHVETLPLTNISTLDTFIJVLNLTFIENlbFKAVELYS SGERKLAN - V FDLBT 
ETYHFKNYVHVETLPVNNI STLDTFLALNLTFI ENIDFKAVELYSSGERKLAN - VFDLBT 
QMYKYKDYY YVSTLNLTD I PTLHTMITLNLSLVENI DFKVI ELYS KTEKRLSN - VFDIET 
HIYKYKNYEYVSTMNLTDVPTLHTMITLNLSLVENVDFQVIQLYSQKEKKLSN-VTOIM 
NMYFYKNYRHVKTMPVGDVATLDTFMVLNLTLVENIDFQVIELYSREEKRMST-AFDIET 
EIHVYNDYHHFKTIELDGIATLQTFISLNTSLIENIDFASLELYSRDEQRASN- VFDLEO 

MFRBYNYY THRLAGLREDLDNT I DMNKERFVRDLS E I VADLGG I GKTWNVAS SWTLCG 
MFRBYWYYT QRLAGLREDLDNT I DLNRDRLARDLS E I VADLGD VGRTWNVAS S VI TL FG 
MTRgYHYYT YSLAGIKKDLDNTIDYNRDRLVQDLSDMMADLGDIGRSVVNVVSSVVTFFS 
MFRBYNYY TQKLASLREDLDNT I DLNRDRL VKDL S EMMADLGD I GKVWNT F SG I VTVFG 
MFRBYNYY AQSISGLRKDFDNSQRNNRDRIIQDFSEILADLGSIGKVIVNIASSAFSLFG 
MFRBYKYYA QSISGLRKDFDNSQRNNRDRI IQDFSEILADLGSIGKVIVNVASGAFSLFG 
MFRBYNYYT QNLNGLRKDLDD S I DHGRD S F I QTLGD I MQDLGT I GKVWNVAS G VF S L FG 
MFRBYMYY TQNLKGLRKDLDDSIHDGRDSFIQFLGDLVQDLVPVGDVIVNVASGVFSLFG 
^RBYNYYTQRVTGLRRDLTD - LATNRNQFVDAFGS LMDDLGWGKTVLNAVSSVATLFS 
IFREYNFQAQNIAGLRKDLDNAVSNGRNQFVDGLGELMDSLGSVGQSITNLVSTVGGLFS 
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SLVTGFINFIKHPLGGMLMI 1 1 VIAI ILI I FMLSRRTNTIAQAPVKMI YP DVDRRA 

SIVSGFINFIKSPFGGMLMILVIVAWLIVFALNRRTNAIAQAPIRMIYP DIDKMQ 

SIVTGFIKFFTNPLGGIFILLIIGGI IFLVVVLNRRNSQFHDAPIKMLYPSVENYAARQA 

SIVGGFVSFFTNPIGGVTI ILLLIVWFWFIVSRRTNNMNEAPIKMI YP NIDKAS 

GIVTGILNFIKNPLGGMLTFLLVGAI I ILVILLVRRTNNMSQAPIRMI YP DIEKSR 

GIVTGILNFIKNPLGGMFTFLLIGAVIILVILLVRRTNNMSQAPIRMIYP DVEKSK 

SIVSGVISFFKNPFGGMLLIVLIIAGVVVVYLFMTRSRSIYSAPIRMLYP GVERAA 

SIVSGVISFLKNPLGAILTIALIVGGI IVLYLFITRSRTVYQAPIRMLYP EVDRAP 

S I VS G 1 1 NF I KN P FGGMLL FGL I AAWI TVI L LNRKAKR F AQN P VQM I Y P DIKTIT 

SLVSGFISFFKNPFGGMLILVLVAGWILVISLTRRTRQMSQQPVQMLYP GIDELA 
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pp SGGAPTREE I KNI LLGMHQLQQ ERQKADDLKKSTPS VFQRTANGLR 

p SGGKVDQEQIKNILAGMHQLQQ EERRRLDEQQRSAPSLFRRASDGLK 

PPPYSA S P PA I DKE E I KR I L LGMHQ VHQ EEKEAQKQLTNSGPTLWQKATGFLR 

EQE NI QPLPGEE I KRI LLGMHQLQQ S EHGKS EEEASHKPGLFQLLGDGLQ 

S SVTPTEPEVIKQILLGMHNMQQ EEYKKREEHKASQPSFLKRATDAFL 

S TVTPMEPETIKQILLGMHNMQQ EAYKKKEEQRAARPSIFRQAAETFL 

QEP GAHPVSEDQIRNILMGMHQFQQRQRAEEEARREEEVKGKRTLFEVIRDSAT 

QQ NVQPI PEDQVRS I LLAMHQFQQQQQQQQQQQQEEHTQ - RRS I FDTIRESTS 

SQREEL QVDPISKHELDRIMLAMHDYHASK- -QPESKQDEEQGSTTSGPADWLNKAK 

QQHASGEG PGINP I S KTELQAI MLALHEQNQ EQKRAAQRAAGPSVASRALQAAR 
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Figure 2 



ATGG CAGGT A GCTTAAAACT TAGGGGATCT GTTCTAGCAC TGTGGTACCT GTATCAGGTG 6 0 
GCTCTTTATT CACTTAGTAT AGCAGAGACC GGTGTAACCT CACCTCCAAA TACAGCGACC 12 0 
TGGT CTACTG AATCGCCGCT AACAGGTCAC TATGGAACAC ACGATT CAAG CCATGGTGAA 180 
AGAGGAAACA ACGAAAACAG AGATT CAGAA GAGCAAAATA AAAACATTTA TGGATCGCCT 24 0 
TCTACGTTTC CTTACAGAGT ATGCAGTGCC TCCGGAGTTG GAGATGTCTT TAGATTTCAG 3 00 
ACCGACCATG TGTGTCCCGA TGCCAGTGAT ATGGTACACA GTGAGGGGAT TCTACTAATT 360 
TACAAACAGA ACATTATTCC ATTTATGTTT AGAGTTAGGA AATATAGAAA AGTTGTTACA 420 
ACAAGTACTG TCTACAATGG T ATTTATT CT GACTCTATTA CCAACCAACA TACTTTCTAT 480 
AAATCAATCG AACCTTGGGA GACAGAAAAG ATGGACACAA TAT AT CAGTG TTTTAATTCT 54 0 
TTAAGACTAA ACACAGGTGG AAATCTGCTT ACTTATGTAG ATAGAGATGA TATAAATATG 600 
ACAGTGTTTC TGCAACCTGT TGACGGTGTG ACGCCCGATG TGAAGAGGTA TGGCAGTCAA 66 0 
CCAGAGCTGT ACCTTGAACC TGGCTGGTTT TGGGGTAGTT ATAGAAGACG AACTACAGTG 72 0 
AACTGTGAAC TAATGGACAT GTTTGCAAGA TCAAATCCTC CATTTGATTT CTTTGTTACA 78 0 
GCTACAGGTG ATACGGTGGA AATGTCTCCA TTTTGGAGTG GTGAAGATGA TCATGAAAAT 84 0 
AAGATGCACG AGAAGCCATG GTTTGTTAGT GTGATAAATA ACTACAAGGT GGTGGACTAT 9 00 
CAAAACAGAG GGACTGTACC CCTTGGAAAA ACAAGGATAT TT CT AGAT AG GGAAGAGTAT 96 0 
= ACATTATCTT GGGAAAAGCA TCTAAAAAAT ATGTCATATT GTCCACTAAC ATTATGGAAA 102 0 
%l GCATTTTACA ATGGAATCCA GACGGAGCAT TCAGGCTCAT ATCATTTTGT AGCCAATGAC 108 0 
if ATCACAGCGT CATTCACAAC TAGTAAAGAA GACATGAAAG AGTTCAATAC GACAT AT CAT 114 0 
hi TGTCTCAACG AGGAAATAAA GGCAGAAATA GAGAAGAAAT ATGCAAAAGT AAACT CAACT 12 0 0 
yl CACTCTAAAT ATGGAGATCT GAAATACTTT AAAACAGATG GGGGTCTCTA TTTAGTCTGG 1260 
^ CAACCTCTTA TTCAAAACAG GCTTCTTGAT GCTAAGAACA AACTGAACAA TGAGACTTAT 13 2 0 
U TCCAGGAGAT CACGACGTCA GGCAGAATCT ACTACTGACC CAATGATGGA GATGACTGGA 13 8 0 
01 AATGGAGCAG GTGGAGAATA TAGCAGTGAA AATTCAATCA CGGTGGCGCA GGTGCAGTAT 14 4 0 
a p GCCTATGACA ATCTTCGTAT CAGAATAAAT AACATTTTGG AAGATTTGTC AAAGGCATGG 15 0 0 
* TGTCGTGAGC AG CAT AGAGC TGCTCTGGTG TGGAATGAGC TCAGCAAGAT TAATCCCACA 156 0 
AGCGTCATGA GCATGATTTA CAATAGAC C C GTATCAGCCA AAAGAATAGG AGATGT CATT 162 0 
TCAGTCTCTA ACTGTATTGT GGTAGACCAA ACCAGTGTCT CATTACATAA AAGTCTCAGG 16 8 0 
CTTCT CAGTG CAT CGGATGA AAAGTGCTTC T CT AG AC CT C CAGTGACATT TAAGTTTATG 174 0 
l M AATGACAGTA CTATTTACAA AGGGCAACTA GGAGT CAAT A ATGAGATTCT CTTAACCACA 18 00 
— ACATACCTTG AAACATGTCA GGAAAACACT GAGTATTACT TTCAGGCAAA GACAGACATG 18 60 
C3 TACATTTACA AAAACTATGA GCATTTGAAG ACTGTGCCTT TATCTTCGAT CACCACACTA 192 0 
HJ GATACATTTA TAGCCCTTAA TTTTACACTA TTGGAGAATG TTGACTTTAA AGT CATTGAA 1980 
CTTTATACCA GGGACGAGAA GAGGCTTAGT AATGTCTTTG ACATTGAAAC AATGTTTAGG 2 04 0 
GAATATAACT ACTATGCTCA GAGGGT CAGT GGCCTCAGAA AGGATTTGCT GGATCTAAGC 2100 
AC CAAT AGAA ATCAATTTGT GGATGCATTT GGTAGTCTTA TGGATGATTT GGGTGCTGTT 216 0 
GGGCAGACAG TTGTAAATGC TGTAAGTGGT GTGGCTACGC TGTTTAGCTC AATTGTAACA 22 2 0 
GGATTTATTA ATTTCATTAA AAACCCATTT GGTGGAATGT TAATGATTAT TGTTGTTATT 22 8 0 
GGTGTGCTAT TTGCCATCTA CTTTCTGACC AAAAAGACGA AGATATATGA GACGGCACCG 23 4 0 
ATTAAGATGA TTTATCCTGA AATTGACAAG CTGAAAGAAC GTGAGGGAAA AT GAG AAAT A 24 0 0 
GCACCAATCA GTGAAGAAGA GCTGGAGAGA ATTGTACTTG CTATGCACAT CCATCAACAA 24 6 0 
AATT GACAT A TGGAAACAAA AACAAGGAAG GATCCCAAAG AC AG CAT ATT AACAAGGGCA 2 52 0 
CAAAATATGC TACGCAAAAG ATCAGGATAT TCTAATTTAA AAAATGCTGA ATCTGTGGAG 2 58 0 
ATGTTAAACA CTTTATAA 2598 
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Figure 3 



MAGSLKLRGS VLALWYLYQV ALYSLSIAET GVTSPPNTAT WSTES PLTGH 50 

YGTHDS SHGE RGNNENRD S E EQNKNIYGSP STFPYRVCSA SGVGDVFRFQ 100 

TDHVCPDASD MVHSEGILLI YKQNIIPFMF RVRKYRKWT TSTVYNGIYS 150 

DSITNQHTFY KSIEPWETEK MDTIYQCFNS LRLNTGGNLL TYVDRDDINM 200 

TVFLQPVDGV TPDVKRYGSQ PELYLEPGWF WGSYRRRTTV NCELMDMFAR 250 

SNPPFDFFVT ATGDTVEMSP FWSGEDDHEN KMHEKPWFVS VINNYKWDY 3 00 

QNRGTVPLGK TRIFLDREEY TLSWEKHLKN MSYCPLTLWK AFYNGIQTEH 3 50 

SGSYHFVAND ITASFTTSKE DMKEFNTTYH CLNEEIKAEI E KKY AKVNS T 4 00 

HSKYGDLKYF KTDGGLYLVW QPLIQNRLLD AKNKLNNETY SRRSRRQAES 4 50 

TTDPMMEMTG NGAGGEYSSE NSITVAQVQY AYDNLRIRIN NILEDLSKAW 500 

CREQHRAALV WNELSKINPT SVMSMIYNRP VSAKRIGDVI SVSNCIWDQ 550 

TSVSLHKSLR LLSASDEKCF SRPPVTFKFM NDSTIYKGQL GVNNEILLTT 6 00 

TYLETCQENT EYYFQAKTDM YIYKNYEHLK TVPLSSITTL DTFIALNFTL 6 50 

LENVDFKVIE LYTRDEKRLS NVFDIETMFR EYNYYAQRVS GLRKDLLDLS 700 

TNRNQFVDAF GSLMDDLGAV GQTWNAVSG VATLFSSIVT GFINFIKNPF 7 50 

GGMLMIIWI GVLFAIYFLT KKTKIYETAP IKMIYPEIDK LKEREGKSEI 800 

APISEEELER IVLAMHIHQQ NSHMETKTRK DPKDS I LTRA QNMLRKRSGY 850 

SNLKNAESVE MLNTL 865 
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pGHV-gpB DNA.txt 
pGHVl DNA . 641-1300) 



_._ 1 1- AATCT TCGTATCAGA ATAAATAACA 

CGCCGCCGTC CGGCTCCACG GTGGTGCGGC TGGAGCCCGA GCAGGC--CT 

T G. C GA A . C . 



25 
688 



pGHV-gpB DNA.txt 
pGHVl DNA . (641-1300) 



pGHV-gpB DNA.txt 
pGHVl DNA. (641-1300) 



pGHV-gpB DNA.txt 
pGHVl DNA. (641-1300) 



pGHV-gpB DNA.txt 
pGHVl DNA. (641-1300) 



'™Z pGHV-gpB DNA.txt 

pGHVl DNA. (641-1300) 

Ul 

^; pGHV-gpB DNA.txt 

UJ pGHVl DNA. (641-1300) 

yl 

i s: pGHV-gpB DNA.txt 

k pGHVl DNA. (641-13 00) 



pGHV-gpB DNA.txt 
f|j pGHVl DNA. (641-1300) 



fl pGHV-gpB DNA.txt 

f[\ pGHVl DNA. (641-1300) 



pGHV-gpB DNA.txt 
pGHVl DNA. (641-1300) 



pGHV-gpB DNA.txt 
pGHVl DNA. (641-1300) 



TTTTGGAAGA TTTGTCAAAG GCATGGTGTC GTGAGCAGCA TAGAGCTGCT 
GCCCCGAGTA CTCG-CAGGG G CGCAACTT C ACGGAGGGGA TCGCCGTGCT 
GA A T G CA G GC TC G . G A T G TGCT 

CTGGTGTGGA ATGAGCTCAG CAAGATTAAT CCCACAAGCG TCATGAGCAT 

CT T-CA AGGAGAACAT C--G-CC--C CGCACAAGT- TCAAGGCCCA 

CT T A A GAG CA C G C CACAAG TCA G C 

GATTTACAAT -AGACCCGTA TCAGC-CAAA AGAATAG-GA GATGTCATTT 
CATCTACTAC AAGAACGTCA TCGTCACGAC CGTGTGGTCC GGGAGCACGT 
AT TAC A AGA C ATCCCA GTG G CAT 

CAGTCTCTAA C - TGTATTG - - TGGTAGACC AAAC CAGTGT CTCATTACAT 
ACGCGGCCAT CACGAACCGC TTCACAGACC GCGTGCCCGT CCCCGTGCAG 
G CACGAG T AGACC GT C C T CA 

AAAAGTCTCA GGCTTCTCAG TGCATCGGAT GAAAAGTGCT T CTCT AGACC 
GAGA - TCACG GACGTGATCG ACCGCCG- -C GGCAAGTGCG TCTCCA- AGG 
AATCC CT G CCG G AAGTGC TCTC A A 

TCCAGTGACA T--TTAA-GT TTATGA-ATG ACAGTACT-A TTTACAAAGG 
CCGAGT - ACG TGCGCAACAA CCACAAGGTG ACCGCCTTCG ACCGCGACGA 
C AGT AC T AA A A TG AC G T C AG 

GCAACTAG- - GAGTCAATAA TGAGATTCT CTTAAC ---CACAACA 

GAACCCCGTC G AGGTGG AC C TGCGCCCCTC GCGCCTGAAC GCGCTCGGCA 
G A C GAG TG G CT CT AAC C C 

TAC-C--TTG AAACA- TGTC -AGG AAA ACACTGAGTA TTAC-TTTCA 

CCCGCGGCTG GCACACCACC AACG AC AC CT ACACCAAGAT CGGCGCCGCG 
C C TG ACA C A GA A ACAC AG C C 

GGCAAAGACA GACATGTACA TTTACAAAAA CT- -AT GAGCATTTGA 

GGCTTCTAC - GACACGGGCA CCTCCGTCAA CTGCATCGTC GAGGAGGTGG 
GGC AC ACA G CA T C AA CT AT GAG A TG 

AG AC - - TGTGCCTT TA TCT TCGATCACCA CACTAGATAC 

AGGCGCGCTC CGTGTACCCC T ACG ACT C CT TCGCCCTGTC CACGGGGGAC 
AG C TGT CC TA CT TCG C CAC G AC 

ATT TATA GCCCTTAATT TTAC - - ACTA TTGGAGAATG TTGACTTTAA 

ATTGTGTACA TGTCCCCCTT CTACGGCCTG CGCGAGGGGG CCCACGGGGA 
ATT TA A C TT TAC T GAG G AC A 



75 
737 



125 
776 



172 
826 



219 
876 



269 
922 



314 
971 



354 
1021 



395 
1071 



439 
1120 



476 
1170 



521 
1220 



pGHV-gpB DNA.txt 
pGHVl DNA. (641-1300) 



AGT CATTG AA CTTT AT AC CA GGG ACG AG - AAGAGGC TTAGTA- -AT 564 

GCACATCG-G CTACGCGCCC GGGCGCTTCC AGCAGGTGGA GCACTACTAC 126 9 

CAT G CT CC GGG C AG G GG A TA A 



pGHV-gpB DNA.txt 
pGHVl DNA. (641-13 00) 



GTCTTTGACA TTGAAACAAT G -- 585 

CCCATCGAC- CTGGACTCGC GCCTCCGCGC CT 13 00 
C T GAC TG A G 
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pGHV-gpB prot 
PGHVlProt. (491-850) 



-■ N LRI 4 

PAAPAAARRA RRSPGPAGTP EPPAVNGTGH LRITTGSAEF ARLQFTYDHI 54 0 

LRI 



pGHV-gpB prot 
PGHVlProt. (491-850) 



pGHV-gpB prot 
PGHVlProt. (491-850) 



pGHV-gpB prot 
PGHVlProt. (491-850) 



pGHV-gpB prot 
Lfe PGHVlProt . (491-850) 



LJ pGHV-gpB prot 

\jk PGHVlProt . (491-850) 

U1 

Ij \ pGHV-gpB prot 

« PGHVlProt. (491-850) 



- -R INN I LED LSKAWCREQH RAALVWNELS KINPTSVMSM IYNRPVSAKR 
QAHVNDMLGR IAAAWCELQN KDRTLWS EMS RLNPSAVATA ALGQRV S ARM 
N L AWC Q WES NP V VSA 



IGDVISVSNC IWDQTSVSL HKSLRLLSAS DEKCFSRPPV TFKFMNDSTI 
LGDVMAISRC VEV-RGGVYV QNSMR-VPGE RGTCYSRPLV TFE - HNGTGV 
GDV S C V V S R C SRP V TF N 

YKGQLGVNNE ILLTTTYLET CQENTEYYFQ AKTDMYIYKN YEHLKTVPLS 
IEGQLGDDNE LLISRDLIEP CTGNHRRYFK LGSGYVYYED YNYVRMVEVP 
GQLG NEL ECNYF YY V 

SITTLDTFIA LNFTLLENVD FKVIELYTRD E KR 

- -ETISTRVT LNLTLLEDRE FLPLEVYTRE ELADTGLLDY SEIQRRNQLH 
T LiN TLLE F E YTR E R 



ALKFYD I DRV VTCVDHNWLL RGIANFFQGL GDVGAAVGKV VLGATGAVIS 



LS NVF 

AVGGMVSFLS NPFGALAIGL LVLAG LV AAF LAYRHISRLR RNPMKALYPV 
LS N F 



52 
590 



102 
637 



152 
687 



185 
735 



185 
785 



190 
835 



s pGHV-gpB prot 

I.. PGHVlProt. (491-850) 



DI E--TM 

TTKTLKEDGV DEGDV 



195 
850 
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pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



C CAG CAT AAT GATAGCCAAT AAT CTGTGTT ACTCTACCCT GATCTTAAAT 



50 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



GACGAGGACG TGACGGGGAT CGACGAGAAA GAT ATT CTG A 



AATCT 

CGGTGCATGT 
AT T 



5 

100 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



f'l 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



P 



as 
i i 

ru 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



pGHV-gpB DNA.txt 
pGHV2 DNA.txt 



TCGTATCAGA 
- - AAACAAGA 
A AGA 

TC - GTGAGCA 
TCTATACTCG 
TC T C 

AATCCCACAA 

AA GGAA 

AA AA 

CAAAAGAATA 
C-ACTG-AT- 
C A G AT 

AAAC CAGTGT 
- GTGCAATGC 
CA TG 

GAAAAGTGCT 
CTGCTGC-CG 
G C 

TACTATTTAC 
GGC - - GGGAC 
C AC 

CAACATACCT 
CAATGT-CCT 
CAA T CCT 



ATA- AAT -AA CATTTTGGAA GATTTGTCAA 
ATACCGTGTA CAGGTTCG-T TAGGAG - CAG 
ATA T A CA TT G A G CA 

GCATAGAGCT GCT CTGGTGT GGAATGAGCT 

GCAC GCT GCT GTCT AG - ATG - GCT 

GCA GCT GCT GT T G ATG GCT 

GCG - TCATGA GCATGATTTA C AAT - AG AC C 
GTGAAGGCGC GCATGAAACG CTGTGAGGAC 
G G G GCATGA C T AG C 



GGAGATGTCA 
- -ACTTGACA 
A TG CA 

CTCATTACAT 
GTT-TTAC- - 
T TTAC 

TCTCTAGACC 
TGTCT- -CCC 
T TCT CC 

AAAGGGCAAC 

A TGC--T 

A GC 

TGAAACATGT 

T TCGT 

T GT 



TTTCAGTCTC 
- AGCAGCAGC 
CAG C 

AAAAGTCTCA 

GGCTTCA 

G TCA 

TC CAGTGACA 
TCTAGCGGCG 
TC AG G C 

TAGGAGTCAA 
TAGG---CA- 
TAGG CA 

CAGGAAAACA 
CTAGAGAATA 
C GA AA A 



TAACTGTATT 
TTGC - - CCTC 
T C T 

GGCTTCTCAG 
CGGGAGCC - G 
G C G 

TTTAAGTTTA 

TCCA TCA 

T A T A 

TAATGAGATT 
-GACGAG-TG 
A GAG T 

CTGAGTATTA 

CG 

C 



AGGCATGGTG 
CGTCAGGGAG 
G CA GG G 

CAGCAAGATT 
CAGGAAGAGA 
CAG AAGA 

CGTAT-CAGC 
CCTATGTTGG 
C TAT T 

GTGGTAGACC 
AAGGT-GAC- 
GGT GAC 

TGCATCGGAT 
TGCA-CGG-T 
TGCA CGG T 

TGAATGACAG 
CCAGC - AT AG 
A A AG 

CTCTTAACCA 
ACTTT AT CAA 
TTA C A 

CTTTCAGGCA 



AAGACAGACA TGTACATTTA CAAAAACTAT GAGCATTTGA AGACTGTGCC 



TTTATCTTCG 


ATCACCACAC 


TAGATACATT 


TATAGCCCTT 


AATTTTACAC 


TATTGGAGAA 


TGTTGACTTT 


AAAGT CATTG 


AACTTTATAC 


CAGGGACGAG 



53 
146 



102 
188 



14 9 
234 



199 
274 



. 249 
314 



299 
357 



349 
394 



399 
420 



449 
420 



499 
420 



549 

■ 420 



pGHV-gpB DNA.txt AAGAGGCTTA GTAATGTCTT TGACATTGAA ACAATG 585 
pGHV2 DNA.txt " " 420 
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Figure 7 



pGHV-gpB prot NLRIRINNIL EDLSKAWCRE QHRAALVWNE LSKINPTSVM SMIYURPVSA 50 

P GHV2 prot.txt S--IMIANNL C YSTLI - - - LNDEDVTG-- * IDE 25 

ii n l ; _ c ; " ' L L T 

pGHV-gpB prot KRIGDVISVS NCIWDQTSV SLHKSLRLLS ASDEKCFSRP PVTFKFMNDS 100 

PGHV2 prot.txt K DILTVH VNKNTV, 7 ■ " "" _YRFVRSS 45 

K D V . V . V ' F S 

pGHV-gpB prot TTYKGQL GV-NNBILiLT" TTYLBTCQEN' TEYYFQAKTD ' MYI -* - - YKN- 142 

pGHV2 prot.txt VRESILGTLL SRWLRKRKEV KARMKRCEDP MLALILDKQQ LALKVTCNAF 95 



pGHV-gpB prot YEHLKTVP-- LSSITTLDTF IALNFTLL-E NVDFKVIELY TRD---EK-R 185 
DGHV2 prot.txt YGFTGAVHGL LPCLPLAASI TSIGRDMLRQ TSDFINNVLS SREYVSEKFS 145 
Y V L LDFLREK 



u 

yi 
y i 



ru 

U 

5 



pGHV-gpB prot LSNV-F- -DI ETM- 195 

pGHV2 prot.txt LSDGDFQGDF SPEC 159 

LS F D 
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Figure 8 



pGHV-gpB 
AF118399 

pGHV-gpB 
AF118399 

pGHV-gpB 
AF118399 

pGHV-gpB 
AF118399 

pGHV-gpB 
AF118399 



DNA 

DNA. txt 



DNA 

DNA . txt 



DNA 

DNA. txt 



DNA 

DNA . txt 



DNA 

DNA. txt 



AAT CTTCGTA 


T CAG AAT AAA 


TAACATTTTG 


GAAGATTTGT 


CAAAGGCATG 


50 


GTGTCGTGAG 


CAGCATAGAG 


CTGCTCTGGT 


GTGGAATGAG 


CTCAGCAAGA 


100 


TTAATCCCAC 


AAGCGTCATG 


AGCATGATTT 


ACAATAGACC 


CGTATCAGCC 


150 


AAAAGAATAG 


GAGATGTCAT 


TTCAGTCTCT 


AACTGTATTG 


TGGTAGACCA 


200 


AACCAGTGTC 


TCATTACATA 


AAAGTCTCAG 


GCTTCTCAGT 


GCATCGGATG 


250 



pGHV-gpB 
AF118399 



DNA 

DNA. txt 



AAAAGTGCTT 



CTCTAGAC CT CCAGTGACAT TTAAGTTTAT GAATGACAGT 300 

TAAT CTATGTCACT 14 

T AT ATG CA T 



pGHV-gpB DNA 
AF118 3 99 DNA. txt 



ACTATTTACA AAGGGCAACT AGGA-GTCAA TAATGAGATT CTCTTAACCA 34 9 

-CTACCC-TA ATCCATCATG AAGACCTGCA TAAATATCCT CAATTAAAGG 62 
CTA A A A A GA T A TAA A T A TTAA 



Z2 pGHV-gpB DNA 

% AF1183 99 DNA. txt 



CAACATACCT TGAAACATGT CAGGAAAACA CTGAGTATTA 

AGGAGGATTA TGAAACAT - - 

A TGAAACAT 



CTTTCAGGCA 
- TTT 



399 
83 



pGHV-gpB DNA 
AF118399 DNA. txt 



AAGACAGACA TGTACATTTA CAAAAACTAT GAGCATTTGA AGACTGTGCC 44 9 

TG ATT - - -AG TT- CTG 95 

TG ATT AG TT CTG 



pGHV-gpB DNA 
AF118399 DNA. txt 



TTTATCTTCG 



ATCACCACAC TAGATACATT TATAGCCCTT AATTTTACAC 4 99 

GTCC- 99 

GTCC 



^: pGHV-gpB DNA 

I \J AF1183 99 DNA. txt 

f* x t 

J~ pGHV-gpB DNA 

HJ AF118399 DNA. txt 



TATTGGAGAA TGTTGACTTT AAAG TCAT-T--GA A CTT 

TGTTCACTTT GTAAAAAAAC ACATATCAGA ATCTCTTCTG 

TGTT ACTTT AAA CAT T GA A CTT 

- -TA -TAC- -CA- - G- -GG- - ACG AGA --AG--AGG- 

TCTAACCTGC TTACAACATG GCTGGCTAAG AGAAAAATGA TCAGAAAGGA 
TA TAC CA G GG A G AGA AG AGG 



534 
139 



555 
189 



pGHV-gpB DNA 
AF118399 DNA. txt 



CTTAGTA- - A TGT-CT- - TT GACA-TTGA- AACAATG 585 

ATT AG CAGCA TGTGCTGACC CAAAGCTCAG GACAAT - TTT AGATAAACAG 23 8 

TTAG A A TGT CT A A T A ACAAT 



pGHV-gpB 
AF118399 

pGHV-gpB 
AF118399 

pGHV-gpB 
AF118399 

pGHV-gpB 
AF118399 

pGHV-gpB 
AF118399 



DNA 

DNA. txt 



DNA 

DNA . txt 



DNA 

DNA . txt 



DNA 

DNA . txt 



DNA 

DNA. txt 



CAGCTTGCAA 


TTAAGGTGAC 


ATG CAATGCT 


GTGTATGGGT 


T CACTGGTGT 


TGCATCTGGT 


ATGCTGCCCT 


GTCTCAAGAT 


TGCAGAGACC 


ATAACTATGC 


AAGGAAGGGC 


CATGTTGGAA 


AAGACAAAAG 


TATTTGTAGA 


GAATTTAAGT 


CATGAGG AT C 


TCCATTCCAT 


CTGTAAGGTT 


GGCTTTATGC 


CTCAGTCACC 








585 
466 




AAACAGCATT 


GATAAACCCT 


TCAAGGTG 





585 
288 



585 
338 



585 
388 



585 
438 
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Figure 9 



pGHV-gpB DNA 
AF118401 DNA.txt 



GAGGACCTGC ATAAGTATCC TCAATTAAAG GAGGATGATT ATGAAACATT 



50 



pGHV-gpB DNA 
AF1184 01 DNA.txt 



AATCTTC GT AT CAGAAT 17 

TTTGATTAGT TCTGGCCCTG TTCACTTTGT AAAAAAACAC ATATCAGAAT 100 

AA C TATCAGAAT 



pGHV-gpB DNA 
AF118401 DNA.txt 



pGHV-gpB DNA 
AF1184 01 DNA.txt 



pGHV-gpB DNA 
AF118401 DNA.txt 



pGHV-gpB DNA 
L.3 AF1184 01 DNA.txt 



pGHV-gpB DNA 
[j1 AF118401 DNA.txt 

U 

Ql pGHV-gpB DNA 

J~ AF118401 DNA.txt 



Q pGHV-gpB DNA 

^ AF118401 DNA.txt 



fjj pGHV-gpB DNA 

JL AF118401 DNA.txt 



S M 



pGHV-gpB DNA 
AF118401 DNA.txt 



pGHV-gpB DNA 
AF1184 01 DNA.txt 



pGHV-gpB DNA 
AF118401 DNA.txt 



AAATAACATT TTGGAAGATT TGTCAAAGGC ATGGTGTCGT GAGCAGCATA 

C-TC TT 

C T TT 

GAGCTGCTCT GGTGTGGAAT GAGCTCAGCA AGATTAATCC CACAAGCGTC 

CTG-TC- GAA- CTT -G CT CACAA C 

CTG TC GAA CT G C CACAA C 

ATGAGCATGA TTTACAATAG ACCCGTATCA GCCAAAAGAA T- - - 

ATG-GC-TG- GCCAAGAGAA AAATGATCAG 

ATG GC TG T 

- - AGG - AG - ATGT - CA - -TTT 

AAAGGAATTG ACAGCATGTG CTGATCCAAA GCTCAGGACA ATTTTAGATA 
AGG AG ATGT CA TTT 

CAGTC T CTA- AC - TGT A TTG-TG-GTA - -GA-CCA- - 

AACAGCAGCT TGCAATTAAG GTGACATGCA ATGCTGTGTA TGGATTCACT 
CAG T TA AC TG A TG TG GTA GA CA 

A- AC-CA G TGTCTC A TTAC 

GGTGTTGCAT CTGGTATGCT GCCATGTCTC AAGATTGCAG AGACCATCAC 
A A C G TGTCTC A TCAC 

AT AAAAGT - - CT -CAG-GCTTC 

TATGCAAGGA AGGGCCATGT TGGAAAAGAC AAAAGT ATTT GT AGAGAAT C 

A AAAAGT T AG G TC 



TCAG TGC A TCGGA 

TGAGTCATGA AGATCTCCGT 
T AG TG A TC G 

GTGACATTTA AGTTTATGAA 

GT--CA-TCA A ACG-- 

GT CA T A A AG 

AGTCAATAAT GAG ATT CT CT 

- G AT AAA 

G ATAA 



T - G AAAAGT - -GCTT--CTC TAGACCTCCA 
TCCATATGTA AGGTTGGCTC TATACCTC-A 
T A A GT G TT CTC TA ACCTC A 

TGACAGTACT ATTTACAAAG GGCAACTAGG 

TG -TTT 

TG TTT 

TAACCACAAC AT AC CTTG AA ACATGTCAGG 



AAAACACTGA GTATTACTTT CAGGCAAAGA CAGACATGTA CATTTACAAA 



67 
105 



117 
125 



158 
152 



172 
202 



200 
252 



217 
302 



235 
352 



273 
401 



323 
417 



373 
423 



423 
423 



pGHV-gpB DNA 
AF1184 01 DNA.txt 

pGHV-gpB DNA 
AF118401 DNA.txt 

pGHV-gpB DNA 
AF118401 DNA.txt 

pGHV-gpB DNA 
AF118401 DNA.txt 



AACTATGAGC ATTTGAAGAC TGTGCCTTTA TCTTCGATCA CCACACTAGA 

TACATTTATA GCCCTTAATT TTACACTATT GGAGAATGTT GACTTTAAAG 

TCATTGAACT TTATACCAGG GACGAGAAGA GGCTTAGTAA TGT CTTTG AC 

ATTGAAACAA TG 585 
423 



473 
423 



523 
423 



573 
423 



11/14 



o 



o 



Figure 10 



Query: 


1970 


Sbjct : 


18669 


Query: 


2030 


Sbjct: 


18729 


Query: 


2090 


Sbjct : 


18789 


Query: 


2150 


Sbj ct : 


18849 


Query : 


2210 


H Sbjct: 


18909 



Mil II Mill II I II II I I 1 1 I 1 I I I I II I II Mill II II I 



I Mill Mill I II I I I II I i I UN MMM I I III III 



II III II II 1 1 II 1 1 1 1 1 1 I I M Mill Mill II II Mill II I 



III I II III Ml II I MINIM I 



I I 1 1 I I II II II III 



caattgtaacaggatttattaatttcattaaaaacccatttggtggaatgtt 22 61 

I II II INI I II lllllllllllllllll INN llllllll 



IF! Score = 91.1 bits (47), Expect = 7e-16 

Identities = 117/152 (76%), Positives = 117/152 (76%) 



U 5 





Query: 


1498 




Sbjct : 


18194 


M 


Query: 


1558 


i ; i 


Sbj Ct : 


18254 


Q 
n i 


Query : 


1618 




Sbjct: 


18314 



tggtgtcgtgagcagcatagagctgctctggtgtggaatgagctcagcaagattaatccc 15 5 7 

INN NIININN INI INI lllllll INN INN II II N 

tggtgccgtgagcagcaccgagcctctctcatgtggaacgagctaagcaaaatcaaccct 18 2 5 - 
acaagcgtcatgagcatgatttacaatagacccgtatcagccaaaagaataggagatgtc 1617 

II II II Mill II III I II INN Mill, llllllll 



atttcagtctctaactgtattgtggtagacca 1649 

II II llllll INN I INN II I I I 
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Figure 11 (a) 



gi I 2337975 (AF005370) glycoprotein B [Alcelaphine herpesvirus 1] 
Length =8 54 

Score = 953 bits (2437), Expect = 0.0 

Identities = 463/804 (57%) , Positives = 589/804 (72%) , Gaps = 26/804 (3%) 

Query: 74 K^IYGSPSTFPYRVCSASGVGDVTRFQTDHVCPDASDMVHSEGILLIYKQNIIPFMFRVR 13 3 

K 1+ PS FP+RVCSAS +GD+FRFQT H CP+ D H+EGILLI+K+NI+P++F+VR 
Sbjct: 55 KGIHSDPSAFPFRVCSASNIGDIFRFQTSHSCPNTKDKEHNEGILLIFKENIVPYVFKVR 114 

Query: 134 KYRKVVTTSTVTNGIYSDSITNQHTFYKSIEPWETEKMDTIYQCFNSLRIiNTGGNLLTYV 193 

KYRK+ VTTST+YNGIY+D+ + TNQH F KS + +ET +MDTIYQC+NSL + GGNLL Y 
Sbjct: 115 KYRKIWTSTIYNGIYADAVTNQHVFSKSVPIYETRRMDTIYQCYNSLDVTVGGNLLVYT 174 

Query: 194 DRDDINMTvF'LQPvTDGVTPDVTCRYGSQPELYLEPGWFWGSYRRRTTVNCEIjMDMFARSNP 253 

D D NMTV LQPVDG++ V+RY SQPE++ EPGW G YRRRTTVNCE+ + AR+ P 
Sbjct: 175 DNDGSNMTVDLQPVDGLSNSVltRYHSQPEIHAEPGWLLG 234 

Query: 254 PFDFFWATGDTV^MSPFWSGEDDHENKMHEKPWFVSVINNYK^A/DYQNRGTVPLGKTRI 313 

PF +F + T GDT+EMSPFWS + E + +V +Y+WDY+ RGT P G TRI 

Sbjct : 23 5 PFRYFITNIGDTIEMSPFWSKAWNETEFSGEPDRTLTVAKDYRWDYKFRGTQPQGHTRI 2 94 

Query: 314 FLDREEYTLSWEKHLKNMSYCPLTLWKAFYNGIQTEHSGSYHFVANDITASFTTSKEDMK 3 73 

F+D+EEYTLSW + +N+SYC WK+F N I+TEH S HFVANDITASF T + 

Sbjct: 295 FVDKEEYTLSWAQQFRNISYCRWAHWKSFDNAIKTEHGKSLHFVANDITASFYTPNTQTR 354 

Query: 3 74 EFNTTYHCLNXXXXXXXXXXXXXVTCSTHSKYGD 43 3 

E + CLN VN THS G +Y+ T+GGL LVWQPL+Q +LLDAK 

Sbjct: 355 EvXGKJWCI^^IESELKSRIJUCvTIDTHSPNGTAQYYL™ 414 

Query: 434 KLN NETYSRRSRRQAESTTDPMMEMTGNGAGGEYSSENSITVAQVQYAYDN 4 84 

L+ T + RSRRQ S + +G Y++E++I + Q+Q+AYD 

Sbjct: 415 LLDAVKKQQNTTTTTTTTRSRRQRRSVS SGIDDVYTAESTILLTQIQFAYDT 466 

Query: 485 LRIRINNILEDLSKAWCREQHRAALWNELSKINPTSVMSMIYNRPVSAKRIGDVISVSN 544 

LR + INN+LE + LS+AWCREQHRA+L+WNELSKINPTSVMS IY RPVSAKRIGDVISVS+ 
Sbjct: 467 LRAQINNVl^EELSRAWCREQHRASLMWNELSKINPTSVMSSIYGRPVSAiCRIGDVISVSH 526 

Query: 545 CIVVDQTSVSLHKSLRLLSA-SDEKCFSRPPVTFKFMNDSTIYKGQLGVNNEILLTTTYL 603 

C+WDQ SVSLH+S+R+ +C+SRPPVTFKF+NDS +YKGQLGVNNEILLTTT + 

Sbjct: 527 CVVVDQDSVSLHRSMRVPGRDKTHECYSRPPVTFKFINDSHLYKGQLGVNNEILLTTTAV 58 6 

Query: 604 ETCQENTEYYFQAXTDf4YIYKNYEHLKTVPLSSITTLDTFIAliNFTLLENVDFKVIELYT 663 

E C ENTE+YFQ +MY YKNY H+KT+P+ + TLDTF+ LN TL+EN+DF+VIELY-+- 
Sbjct: 587 EICHENTEHYFQGGNNMYFYKNYRHVTCTMPVGDVAT^ 646 

Query: 664 RDEKRLSNVTDIETMFREYNYYAQRVSGLRKDLLDLSTN^ 723 

R+EKR+S FDIETMFREYNYY QRV+GLR+DL DL+TNRNQFVDAFGSLMDDLG VG+T 
Sbjct: 647 REEKRMSTAFDIETMFREYTJYYTQRVTGLRRDLTDI^TN 706 



13/14 



o 



o 



Figure 11(b) 



Query: 724 WNAVSGVATLF SS I VTGF INF I KNPFGGMLM 1 I WTGVLFAI YFLTKKTKI YETAP I KM 78 3 

V+NAVS VATLFSSIV+G INFIKNPFGGML+ + + V+ + L +K K + P++M 
Sbjct: 707 VIoNAVSSVATLFSSIVSGIINFIKNPFGGMLLFGLIAAVVITVILLNRKAKRFAQNPVQM 766 

Query: 784 IYPEIDKLKEREGKSEIAPISEEELERIVLAMHIHQQNSHMETK TRKDPKDSI 836 

IYP+I + + + ++ PIS+ EL+RI+LAMH + + E + K T P D 

Sbjct: 767 IYPDIKTITSQREELQVDPISKHELDRIMLAMHDYHASKQPESKQDEEQGSTTSGPAD-W 825 

Query: 83 7 LTRAQNMLRKRSGYSNIiKNAESVE 860 

L +A+N+LR+R+GY LK + S E 
Sbjct: 826 LNKAKNVLiRRRAG YKPLKRTDS FE 849 



61 



f*=<e 



u 
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